An Empirical Study of Cost-sensitive Classification in Campaign Management

نویسندگان

  • Ying Lu
  • Atish P. Sinha
  • Huimin Zhao
  • Thomas Martin
چکیده

Extremely unbalanced data and unequal costs are key challenges in data mining for campaign management in CRM. This paper presents an empirical study of cost-sensitive classification in a real-world campaign analysis in the newspaper industry, where the response rate is extremely low while the two types of misclassification costs are very different. Incorporating cost information provided by domain experts, the authors investigate the use of three classification methods, i.e., naive Bayes, logistic regression, and decision tree, for campaign response prediction. The data are analyzed under various estimated cost matrices. Cross-validated evaluation results show that cost-sensitive classification techniques are capable of obtaining both higher expected response rates and higher expected profits than the current practice when the estimated misclassification costs are moderately unequal. This empirical study provides useful insights for practice in campaign management, as well as research in classification using extremely unbalanced data and unequal costs.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Proposing a Novel Cost Sensitive Imbalanced Classification Method based on Hybrid of New Fuzzy Cost Assigning Approaches, Fuzzy Clustering and Evolutionary Algorithms

In this paper, a new hybrid methodology is introduced to design a cost-sensitive fuzzy rule-based classification system. A novel cost metric is proposed based on the combination of three different concepts: Entropy, Gini index and DKM criterion. In order to calculate the effective cost of patterns, a hybrid of fuzzy c-means clustering and particle swarm optimization algorithm is utilized. This ...

متن کامل

A New Formulation for Cost-Sensitive Two Group Support Vector Machine with Multiple Error Rate

Support vector machine (SVM) is a popular classification technique which classifies data using a max-margin separator hyperplane. The normal vector and bias of the mentioned hyperplane is determined by solving a quadratic model implies that SVM training confronts by an optimization problem. Among of the extensions of SVM, cost-sensitive scheme refers to a model with multiple costs which conside...

متن کامل

Top Management Perception towards Training: An Empirical Study of Travel Agencies in India

The need to develop the required human resources in the tourism industry has become imperative as a consequence of rapidly changing technology and dynamic changes in the international tourism market. And in this regard the role and commitment of the top management towards developing the human resources cannot be undermined. A study has been conducted to assess the perception of the top manageme...

متن کامل

Calculating benefits received from Business Process Outsourcing (BPO): An empirical study of a food industry company in Iran

Today, outsourcing is recognized as one of the most effective strategies in the business world. In this regard, outsourcing of business processes is considered to be one of the most common forms of outsourcing. The purpose of this study is to provide In-depth and quantitative analysis of the benefits of BPO in a dairy plant in Iran and how these benefits affect the willingness of senior plant m...

متن کامل

Ensemble Classification and Extended Feature Selection for Credit Card Fraud Detection

Due to the rise of technology, the possibility of fraud in different areas such as banking has been increased. Credit card fraud is a crucial problem in banking and its danger is over increasing. This paper proposes an advanced data mining method, considering both feature selection and decision cost for accuracy enhancement of credit card fraud detection. After selecting the best and most effec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004